Quantitative Structure-Electrochemistry Relationship Study of Some Organic Compounds Using PC-ANN and PCR
نویسندگان
چکیده
Internet Electron. J. Mol. Des. 2003, 1, 000–000 Abstract Motivation. A QSPR analysis has been conducted on the half-wave reduction potential (E1/2) of a diverse set of organic compounds by means of principal component regression (PCR) and principal component-artificial neural network (PC-ANN) modeling method. Genetic algorithm was employed as a factor selection procedure for both modeling methods. The results were compared with two other factor selection methods namely eigen-value ranking (EV) and correlation ranking (CR) procedures. Method. By using the Dragon software more than 1000 structural descriptors were calculated for each molecule. The descriptor data matrix was subjected to principal component analysis and the most significant principal components (PC) were extracted. Multiple linear regression and artificial neural network were employed for the respective linear and nonlinear modeling between the extracted principal components and E1/2. First, the principal components were ranked by decreasing eigen-values and entered successively to each modeling method separately. In addition, the factors were ranked by their corresponding correlation (linear correlation for PCR and nonlinear correlation for PC-ANN models) with the half-wave potentials and entered to the models. Finally, genetic algorithm (GA) was also employed to select the best set of factors for both models. Results. The 96% of variances in the descriptor data matrix could be explained by 30 first extracted PCs. Among these, 10, 6 and 10 PCs were selected by EV, CR and GA, respectively, for PCR , while for the ANN model, 7 PCs were selected by all of the factor selection procedures. The ANN model with EV, CR and GA factor selection procedures could explain 78.4%, 94.3% and 96% of variances in the E1/2 data, respectively. While, the respective values obtained from different PCR procedures were 52.9%, 58.2% and 74.4%. Conclusions. The results of this project showed that factor selection by correlation ranking and genetic algorithm gives superior results relative to those obtained by eigen value ranking. This confirms that the magnitude of the eigen value of a PC is not necessarily a measure of its significance in calibration. Moreover, it was found that for PCR method, the results obtained by GA has a major difference with those by EV and CR procedures, while, the GA and CR factor selection methods give results close to each other.
منابع مشابه
Informatics aided QSRR study of retention index of some volatile compounds
In the present work, an artificial neural network (ANN) model was used to study the quantitative structure retention relationship (QSRR) of retention index (RI) of some volatile compounds in natural cocoa and conched chocolate powder. Molecular structural descriptors are selected using genetic algorithm to construct the nonlinear QSRR models, kernel partial least squares PLS (KPLS) and Levenber...
متن کاملQuantitative Structure-Pproperty Relationship Modeling of the Redox Potential for Some Phenolic Antioxidants
In this work, quantitative structure-property relationship (QSPR) approaches were used to predict the redox potential of 42 phenolic antioxidants. The structures of all compounds optimized by the AM1 semi-empirical method and then a large number of molecular descriptors were calculated for each compound in the data set. Subsequently, stepwise multilinear regression was applied to select the mos...
متن کاملPrediction of IC50 of 2,5-diaminobenzophenone organic derivatives using informatics-aided genetic algorithm
In the present paper, informatics-aided quantitative structure activity relationship (QSAR) models using genetic algorithm-partial least square (GA-PLS), genetic algorithm-Kernel partial least square (KPLS), and Levenberg-Marquardt artificial neural network (LM ANN) approach were constructed to access the antimalarial activity (pIC50) of 2,5-diaminobenzophenone derivatives. Comparison of errors...
متن کاملQuantitative structure—retention relationship analysis of nanoparticle compounds
Genetic algorithm and partial least square (GA-PLS), the kernel PLS (KPLS) and Levenberg-Marquardt artificial neural network (L-M ANN) techniques were used to investigate the correlationbetween retention time (RT) and descriptors for 15 nanoparticle compounds which obtained by thecomprehensive two dimensional gas chromatography system (GC x GC). Application of thedodecanethiol monolayer-protect...
متن کاملQSAR Prediction of Half-Life, Nondimentional Eeffective Degradation Rate Constant and Effective Péclet Number of Volatile Organic Compounds
In this work some quantitative structure activity relationship models were developed for prediction of three bioenvironmental parameters of 28 volatile organic compounds, which are used in assessing the behavior of pollutants in soil. These parameters are; half-life, non dimensional effective degradation rate constant and effective Péclet number in two type of soil. The most effective descripto...
متن کامل